Multiscale Geometric Methods for Data Sets II: Geometric Multi-Resolution Analysis
نویسندگان
چکیده
Data sets are often modeled as point clouds in R, for D large. It is often assumed that the data has some interesting low-dimensional structure, for example that of a d-dimensional manifold M, with d much smaller than D. When M is simply a linear subspace, one may exploit this assumption for encoding efficiently the data by projecting onto a dictionary of d vectors in R (for example found by SVD), at a cost (n + D)d for n data points. When M is nonlinear, there are no “explicit” constructions of dictionaries that achieve a similar efficiency: typically one uses either random dictionaries, or dictionaries obtained by black-box optimization. In this paper we construct data-dependent multi-scale dictionaries that aim at efficient encoding and manipulating of the data. Their construction is fast, and so are the algorithms that map data points to dictionary coefficients and vice versa. In addition, data points are guaranteed to have a sparse representation in terms of the dictionary. We think of dictionaries as the analogue of wavelets, but for approximating point clouds rather than functions.
منابع مشابه
Multi-Resolution Geometric Analysis for Data in High Dimensions
Large data sets arise in a wide variety of applications and are often modeled as samples from a probability distribution in high-dimensional space. It is sometimes assumed that the support of such probability distribution is well approximated by a set of low intrinsic dimension, perhaps even a lowdimensional smooth manifold. Samples are often corrupted by high-dimensional noise. We are interest...
متن کاملSome recent advances in multiscale geometric analysis of point clouds
We discuss recent work based on multiscale geometric analysis for the study of large data sets that lie in high-dimensional spaces but have low-dimensional structure. We present three applications: the first one to the estimation of intrinsic dimension of sampled manifolds, the second one to the construction of multiscale dictionaries, called geometric wavelets, for the analysis of point clouds...
متن کاملHesitant q-rung orthopair fuzzy aggregation operators with their applications in multi-criteria decision making
The aim of this manuscript is to present a new concept of hesitant q-rung orthopair fuzzy sets (Hq-ROFSs) by combining the concept of the q-ROFSs as well as Hesitant fuzzy sets. The proposed concept is the generalization of the fuzzy sets, intuitionistic fuzzy sets, hesitant fuzzy sets, and Pythagorean fuzzy sets as well as intuitionistic hesitant fuzzy sets (IHFSs) and hesitant Pythagorean fuz...
متن کاملEffect of Phase-Encoding Reduction on Geometric Distortion and BOLD Signal Changes in fMRI
Introduction Echo-planar imaging (EPI) is a group of fast data acquisition methods commonly used in fMRI studies. It acquires multiple image lines in k-space after a single excitation, which leads to a very short scan time. A well-known problem with EPI is that it is more sensitive to distortions due to the used encoding scheme. Source of distortion is inhomogeneity in the static B0 field that ...
متن کاملMultiscale Geometric Methods for Data Sets I: Multiscale SVD, Noise and Curvature
Large data sets are often modeled as being noisy samples from probability distributions μ in R, withD large. It has been noticed that oftentimes the supportM of these probability distributions seems to be well-approximated by low-dimensional sets, perhaps even by manifolds. We shall consider sets that are locally well approximated by k-dimensional planes, with k ≪ D, with k-dimensional manifold...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011